Multilingual Sentence Generation
نویسندگان
چکیده
This paper presents an overview of a robust, broad-coverage, and application-independent natural language generation system. It demonstrates how the different language generation components function within a multilingual Machine Translation (MT) system, using the languages that we are currently working on (English, Spanish, Japanese, and Chinese). Section 1 provides a system description. Section 2 focuses on the generation components and their core set of rules. Section 3 describes an additional layer of generation rules included to address applicationspecific issues. Section 4 provides a brief description of the evaluation method and results for the MT system of which our generation components are a part. 1 System Description We present a natural language generation method in the context of a multi-lingual MT system. The system that we have been developing is a hybrid system with rule-based, example-based, and statistical components. Analysis and generation are performed with linguistic parsers and syntactic realization modules, the rules of which are coded by hand. Transfer is accomplished using transfer rules/mappings automatically extracted from aligned corpora. The MT process starts with a source sentence being analyzed by the source-language parser, which produces as output a syntactic tree. This tree is input to the Logical Form module, which produces a deep syntactic representation of the input sentence, called the LF (Heidorn, G. E., 2000). The LF uses the same basic set of relation types for all languages. Figure 1 gives the syntactic tree and LF for the simple English sentence, “I gave the pencils to John”.
منابع مشابه
Génération de phrases multilingues par apprentissage automatique de modèles de phrases. (Multilingual Natural Language Generation using sentence models learned from corpora)
Multilingual Natural Language Generation using sentence models learned from corpora Natural Language Generation (NLG) is the natural language processing task of generating natural language from a machine representation system. In this thesis report, we present an architecture of NLG system relying on statistical methods. The originality of our proposition is its ability to use a corpus as a lea...
متن کاملBroad Coverage Multilingual Deep Sentence Generation with a Stochastic Multi-Level Realizer
Most of the known stochastic sentence generators use syntactically annotated corpora, performing the projection to the surface in one stage. However, in full-fledged text generation, sentence realization usually starts from semantic (predicate-argument) structures. To be able to deal with semantic structures, stochastic generators require semantically annotated, or, even better, multilevel anno...
متن کاملCross Lingual Query Dependent Snippet Generation
The present paper describes the development of a cross lingual query dependent snippet generation module. It is a language independent module, so it also performs as a multilingual snippet generation module. It is a module of the Cross Lingual Information Access (CLIA) system. This module takes the query and content of each retrieved document and generates a query dependent snippet for each ret...
متن کاملMetonymy: Reassessment, Survey of Acceptability, and its Treatment in a Machine Translation System
In this article we outline a basic approach to treating metonymy properly in a multilingual machine translation system. This is the first attempt at treating metonymy in an machine translation environment. The approach is guided by the differences of acceptability of metonymy which were obtained by our comparative survey among three languages, English, Chinese, and Japanese. The characteristics...
متن کاملDriving Multilingual Sentence Generation with Lexico-grammatical Resources
JOHN HIT using AGENT the thematic roles and HIT THEME. and MARY The input consists of the predicate and two arguments joined with the predicate
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001